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"Core Structure of gp41 from the HIV Envelope Glycoprotein," by David C Chan, 
5 Deborah Fass, Min Lu, James M. Berger and Peter S. Kim, filed July 18, 2002, which is 
a continuation of U.S. Application No. 09/484,925, entitled, "Core Structure of gp41 
From the HIV Envelope Glycoprotein", by David C. Chan, Deborah Fass, Min Lu, 
James M. Berger and Peter S. Kim, filed January 18, 2000, which is a Divisional 
Application of 09/062,241, entitled, "Core Structure of gp41 From the HIV Envelope 

10 Glycoprotein", by David C. Chan, Deborah Fass, Min Lu, James M. Berger and Peter S. 
Kim, (filed on April 17, 1998) now U.S. Patent No. 6,150,088 (issued November 21, 
2000), which claims the benefit of U.S. Provisional Application 60/043,280, entitled 
"Core Structure of gp41 from the HIV Envelope Glycoprotein", by David C. Chan, 
Deborah Fass, Min Lu, James M. Berger and Peter S. Kim (filed April 17, 1997). The 

15 entire teachings of the above applications are incorporated herein by reference. 
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BACKGROUND OF THE INVENTION 

The surface glycoproteins of enveloped viruses play critical roles in the initial 
events of viral infection, mediating virion attachment to cells and fusion of the viral and 
immune response in infected hosts. Envelope glycoproteins are also major targets for 
5 the anti-viral immune response in infected hosts. The human immunodeficiency virus 
type 1 (HIV-1) envelope glycoprotein consists of two noncovalently associated subunits, 
gpl20 and gp41, that are generated by proteolytic cleavage of a precursor polypeptide, 
gpl60. Luciw, P.A., In Fields Virology, Third Edition, B. N. Fields et ai, eds., 
Lippincott-Raven Publishers, Philadelphia, pp. 1881-1952 (1996); Freed, E. O. et al. 9 J. 

10 Biol Chem. 270: 23883-23886 (1995). gpl20 directs target-cell recognition and viral 
tropism through interaction with the cell-surface receptor CD4 and one of several 
co-receptors that are members of the chemokine receptor family. Broder, C. C. et al, 
Pathobiology 5*171-179 (1996); D'Souza, M. P. et al. 9 Nature Med. 2:1293-1300 
(1996); Wilkinson, D., Current Biology (5:1051-1053 (1996). The membrane-spanning 

15 gp41 subunit then promotes fusion of the viral and cellular membranes, a process that 
results in the release of viral contents into the host cell. It has not yet been possible to 
obtain a detailed structure for gp41, either alone or in complex with gpl20. 

SUMMARY OF THE INVENTION 

Described herein is the crystal structure of the a-helical domain of the gp41 

20 component of HIV-1 envelope glycoprotein which represents the core of fusion- active 
gp41. Also described herein is Applicants* determination, with reference to the crystal 
structure, that certain amino acid residues within the core are essential for interaction of 
the component peptides and, thus, for gp41 activity. The core of fusion-active gp41 is 
composed of a trimer of two interacting peptides, referred to here as N36 and C34. The 

25 minimal stable envelope subdomain has been shown to consist of a 36-residue peptide 
(N-36) and a 34-residue peptide (C-34) whose amino acid sequences are presented 
below. The crystal structure of the N36/C34 complex is a six-helix bundle in which 
three N36 helices form an interior, parallel coiled coil and three C34 helices pack in an 
oblique, anti-parallel manner into highly conserved, hydrophobic grooves on the surface 
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of the N36 trimer. It shows striking similarity to the low-pH induced conformation of 
influenza hemagglutinin (HA). 

Applicants have determined the structural basis for interaction between two 
peptide fragments of HIV gp41 : one peptide fragment derived from the N-terminal 
5 region of the ectodomain of gp41 and one peptide fragment derived from the C-terminal 
region of the gp41 ectodomain. The N-terminal peptide fragment, N36, includes amino 
acid residue 546 through and including amino acid residue 581, numbered according to 
their position in HIV-1 gpl60; it includes amino acid residues which comprise a region 
of the ectodomain which encompasses the 4-3 hydrophobic repeat. The amino acid 

10 sequence of the N36 peptide is: 

SGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQRIL (SEQ ID NO.: 1). 
The C-terminal region peptide fragment C34 includes amino acid residue 628 through 
and including amino acid residue 661, numbered according to their position in HIV-1 
gpl60; it is derived from the region prior to the transmembrane segment. The amino 

15 acid sequence of the C34 peptide is: 

WMEWDREINNYTSLIHSLIEESQNQQEKNEQELL (SEQ ID NO.: 2). The three- 
dimension coordinates for the atoms in the N36/C34 gp41 complex are presented 
herein. They can be used to display the structure of the complex and to design 
molecules (drugs) which interact with gp41 and inhibit its activity, such as those which 

20 prevent interaction of key components (amino acid residues) of the a-helical domain 
which represents the core of fusion-active gp41. 

Work described herein provides, for the first time, an understanding of how the 
N-terminal peptide and the C-terminal peptide interact. The crystal structure and 
information regarding the interactions of these two peptides provide the basis for 

25 development of drugs which inhibit HIV infection, such as peptidomimetic or small- 
molecule inhibitors, using such methods as combinatorial chemistry or rational drug 
design. Drugs developed or identified with reference to the information provided herein 
are also the subject of the present invention. Drugs which fit into or line the N-peptide 
cavity, prevent the N-peptide cavity from accommodating amino acid residues or 

30 peptides from the C-terminal region of gp41 and, thus, prevent or inhibit gp41 activity 
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are the subject of this invention. Such drugs can be identified with reference to the 
information about the structure of the complex and the cavity shown to be present in the 
N36 trimer, provided herein, or with reference to information about the structure of the 
complex and the three dimensional coordinates of the cavity, also provided herein, and 
5 known methods. In a particular embodiment of identifying or designing a molecule 
which inhibits the fusion active form of gp41 and, thus, inhibit HIV, in which 
combinatorial chemistry is used, a library biased to include an increased number of 
indole rings, hydrophobic moieties and/or negatively charged molecules is used. An 
antibody which binds these key areas of fusion-active gp41 is also the subject of the 
10 invention. For example, an immunogen which is or includes a molecule with the 
coordinates described herein or the N-peptide core can be used to immunize an 
individual, resulting in production of antibodies that bind the cavity or pocket on the N- 
terminal peptide and, thus, render it unavailable for its normal interactions and prevent 
or inhibit gp41 activity. 



1 5 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a schematic view of gp41 showing important functional regions, 
including the 4-3 hydrophobic repeat, the fusion peptide (fp), a disulfide linkage (S-S), 
and the transmembrane region (tm). The ectodomain is drawn approximately to scale. 
The peptides identified by protein dissection are shown above, along with the sequences 

20 of N36 and C34. The residues are numbered according to their position in gpl60. 

Figure 2 is a representative portion of the initial electron density map calculated 
using experimental structure- factor amplitudes and solvent-flattened MAD phases, 
shown with the refined molecular model. The map is contoured at 1.5 standard 
deviations above the mean density. The figure was generated with the program O 

25 (Jones, T. A., and Kjeldgaard, M., O The Manual, Uppsala, Sweden: 
http://kaktus.kemi.aau.dk (1992)). 

Figures 3 A and 3B present overall views of the N36/C34 complex. Figure 3 A 
shows an end-on view of the N36/C34 complex looking down the three-fold axis of the 
trimer. Figure 3B shows a side view with one N36 and one C34 helix labeled. The 
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amino termini of the N36 helices (grey) point towards the top of the page, while those of 
the C34 helices (black) point towards the bottom. Diagrams were prepared using the 
program MOLSCRIPT (Kraulis, P., J. Appl Cryst. 24:924-950 (1991)). 

Figure 4 shows a helical wheel representation of N36 and C34; three N36 helices 
5 and one C34 helix are represented as helical wheel projections. The view is from the 
top of the complex, as in Figure 3 A. The residues at each position are represented by 
the single-letter codes for amino acids. The N36 helices interact through 
"knobs-into-holes" packing interactions at the a and d positions. Positions of the N36 
and C34 helices that occupy the interhelical space between two N36 helices and a C34 
10 helix are shown (arrows). The helical wheel positions in C34 are indicated by ellipses 
to represent the oblique orientation of this helix relative to N36. At the top of the 
complex, C34 is slightly tilted towards the left N36 helix, while at the bottom of the 
complex, it is slightly tilted towards the right N36 helix. 

Figures 5 A-5P present the three-dimension coordinates for the atoms in the 
15 N36/C34 gp41 complex; the atom types (column 3) in each amino acid (column 4) are 
listed, along with their coordinates (columns 6, 7, 8) in space. The three-dimension 
coordinates can be used to display the structure of the N36/C34 complex. The 
coordinates are available from the Protein Data Bank at the Brookhaven National 
Laboratory. 

20 Figure 6 represents the distances (in A) between the atoms in the four amino acid 

residues of C34 that dock into the cavity on the N36 trimer surface. The two tryptophan 
residues, and the isoleucine residue and the aspartic acid residue are indicated in green. 

DETAILED DESCRIPTION OF THE INVENTION 

For the first time, a high-resolution picture of the protein fragment that enables 
25 HIV to invade human cells has been produced. As described, Applicants have 

determined the crystal structure of a key fragment of the HIV envelope protein. The 
envelope protein resides on the surface of the virus and plays a crucial role in HIV 
infection. One part of the protein, known as gpl20, allows the virus to bind to human 
cells. Another subunit, gp41, mediates fusion of the viral membrane and the cell 



0399.1167-007 



-6- 



membrane - it initiates entry of the virus into the cell. The core structure of gp41 has 
been determined using X-ray crystallography. 

The images of the protein fragment reveal a compact, six-helix bundle 
punctuated by deep cavities which are key targets for the development of new antiviral 
5 drugs. The existence of the cavities could not have been determined without the 
images. 

Despite its importance, there are no antiviral drugs that target the envelope 
protein of HIV, in part because the virus is extraordinarily clever at changing the pieces 
of the protein it presents to the outside world. Work presented herein shows that the 

10 cavity structure may not be so amenable to change; therefore, drugs directed towards 
this region are useful against many HIV strains. 

The HIV fusion protein has characteristics similar to those of the fusion structure 
of influenza virus. Surprisingly, the HIV fusion protein has a deep cavity or pocket at 
the base of each groove in the N36 coiled coil. In the active structure, each cavity is 

1 5 filled by a knob-like protrusion from C34. This ball-and-socket arrangement of C34 
and N36 is a target for drug design or discovery. The structure, combined with data 
from other laboratories, supports the idea that a small molecule constructed specifically 
to block this interaction will stop fusion and prevent the virus from entering cells. 
There are at least three reasons why such a molecule would be effective in 

20 preventing HIV from entering cells. First, test tube studies have shown that fragments, 
or peptides, of gp41 encompassing or overlapping with N36 or C34 have potent anti- 
viral activity. However, peptides generally make poor drugs because they are poorly 
absorbed and the body breaks them down almost immediately. A small molecule 
targeting just the cavity structure could escape this fate. 

25 Second, the inhibitors derived from the C and N peptides are effective in the test 

tube against a wide range of HIV strains, including patient isolates and laboratory- 
adapted strains. By contrast, neutralizing antibodies and drug candidates designed to 
block the binding activity of the envelope protein are typically effective against only a 
limited subset of HIV strains. 
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Third, alteration of the walls of the N36 cavity can block the fusion reaction, 
indicating that the ball-and-socket arrangement of N36 and C34 must be preserved to 
obtain viral infection. In addition, the protein building blocks that make up the walls 
are highly conserved among HIV strains and between HIV and SIV, the virus 
5 responsible for AIDS in monkeys. This suggests that the virus cannot tolerate much 
change in this region and that HIV may have more difficulty developing resistance to a 
cavity-blocking drug than to many other compounds. 

Applicants have analyzed the crystal structure of the a -helical domain of the 
HIV-1 transmembrane protein gp41 by means of assessment of a complex, referred to 

10 herein as the N36/C34 complex, which is composed of two interacting peptides: N36, 
which is derived from the N- terminal region of the gp41 ectodomain and C34, which is 
derived from the C-terminal region of the gp41 ectodomain. As described herein, 
Applicants have shown that the N36/C34 complex is a six-helix bundle (Figure 3), in 
which the center consists of a parallel, trimeric coiled-coil of three N36 helices wrapped 

15 in a gradual left-handed superhelix. Three C34 helices wrap antiparallel to the N36 
helices in a left-handed direction around the outside of the central coiled-coil N36 
trimer. The N36/C34 complex is a cylinder which is approximately 35 A in diameter 
and approximately 55 A in height. Figure 4 is a helical wheel representations of N36 
and C34 in which three N36 helices and one C34 helix are represented as helical wheel 

20 projections. As can be seen, the interior amino acid residues at the a and d positions of 
the N36 heptad repeat are predominately hydrophobic (isoleucine, leucine). The 
characteristic "knobs-into-holes" packing of coiled coils occurs in the N36 trimer. That 
is, the amino acid residues (knobs) at the a and d layers pack into cavities (holes) 
between four residues of an adjacent helix. Crick, F.H.C., Acta. Cryst, 6: 689-697 

25 (1953); O'Shea, E.K., et al, Science, 254:539-544 (1991). Further description of the 
N36 trimer is presented in Example 2. 

An electrostatic potential map of the cylindrical N36 superhelix shows that the 
surface of the superhelix is largely uncharged. The grooves that are the sites for C34 
interaction have been determined to be lined with predominantly hydrophobic amino 

30 acid residues. The surface of the N36/C34 complex is much more highly charged than 
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the isolated N-pep tides, due to the acidic residues on the outside of the C34 helices. 
This explains why the heterodimeric complex exhibits greater solubility than the 
isolated peptides. 

Three C34 helices pack obliquely against the outside of the N36 coiled-coil 
5 trimer in an antiparallel orientation. Interaction between the C34 helices and N36 
occurs mainly through hydrophobic residues in three grooves on the surface of the 
central coiled-coil trimer. The amino acid residues which line these grooves are highly 
conserved between HIV and SIV gp41 . In contrast, the N36 residues which flank the 
C34 helices are divergent. The pattern of sequence conservation is also apparent on the 
10 helical wheel representation of three N36 helices and one C34 helix of Figure 4. (See 
Example 3.) 

Each of the grooves on the surface of the N36 trimer has a particularly deep 
cavity. The cavity is approximately 16 A long, approximately 7 A wide and 
approximately 5-6 A deep. It accommodates three hydrophobic amino acid residues 

15 from the abutting C34 helix: isoleucine-635(I 635 ), tryptophan-63 1 (W 631 ) and 

tryptophan-628 (W 628 ). The top of the N36 cavity is lined by leucine-566 (Leu-566) of 
the left N36 helix and leucine-565 (Leu-565) of the right N36 helix. The left side of the 
cavity is formed by side chains from the left N36 helix, including amino acid residues 
(top to bottom): valine-570 (Val-570), lysine-574 (Lys-574, aliphatic portion) and 

20 glutamine-577 (Gln-577). The right wall of the cavity is formed by amino acid residues 
leucine-568 (Leu-568), tryptophan-571 (Trp-571) and glycine-572 (Gly-572) of the right 
N36 helix. The cavity floor is composed of threonine-569 (Thr-569), isoleucine-573 
(Ile-573) and leucine-576 (Leu-576). Thus, interactions within the cavity are 
predominantly hydrophobic. In addition, aspartic acid-632 (Asp-632) of C34 forms a 

25 conserved salt bridge with lysine-574 (Lys-574) of N36 immediately to the left of the 
cavity. 

As a result of the work described, a region of the HIV-1 transmembrane protein 
gp41 which is a target for HIV inhibitors has been defined and is available for designing 
and/or developing new drugs and identifying existing drugs which inhibit HIV. A 
30 particularly valuable target for an HIV inhibitor are the highly conserved, deep cavities 
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on the N-peptide coiled-coil trimer that accommodate C-peptide amino acid residues. 
The amino acid residues which form the cavity have been defined. Thus, a drug (e.g., a 
peptide, peptidomimetic, small molecule or other agent) which fits into or lines the N- 
peptide cavity or socket, prevents the N-peptide cavity from accommodating peptides 
5 from the C-terminal region of gp41 and, thus, prevents or inhibits gp41 activity, can be 
identified or designed. For example, a drug which fits into or lines the cavity can be 
identified or designed, using known methods. One such drug is a molecule or 
compound which fits into or lines a cavity: 

a) lined by Leu-566 of the left N36 helix and Leu-565 of the right N36 
10 helix; 

b) formed on the left side by sidechains from the left N36 helix, including 
residues (top to bottom) Val-570, Lys-574 (aliphatic portion) and Gln- 
577; 

c) formed on the right side by residues Leu-568, Trp-571 and Gly-572 of . 
15 the right N36 helix; and 

d) composed on its floor of Thr-569, Ile-573 and Leu-576. 

The cavities present on the N-peptide coiled-coil trimer each accommodate three 
hydrophobic amino acid residues from the abutting C34 helix: Ile-635, Trp-631 and 
Trp-628 and a negatively charged amino acid residue from C34: Asp-632, which forms 

20 a conserved salt bridge with Lys-574 of N36 immediately to the left of the cavity. A 
drug which mimics the ability of these three residues (Trp-Trp-Ile) to fit into or line N36 
cavities can also be developed. Such a drug can be developed, for example, with 
reference to the three-dimension coordinates provided (Figures 5 A-5P) and the 
information provided (Figure 6, for example) regarding the distances between the atoms 

25 in the four amino acid residues of C34 that dock into the cavity on the N36 trimer 
surface. 

For example, a structure-based approach can be used, along with available 
computer-based design programs, to identify or design a drug which will fit into, line or 
bind a cavity or pocket on N36 (or block C34 from doing so) and inhibit or prevent the 
30 activity of gp41 and, as a result, reduce (partially or totally) the ability of HIV- 1 to infect 
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cells. In one embodiment of the present invention, the following method is carried out 
to design or identify a molecule or drug which inhibits gp41 activity (and reduces HIV-1 
infection of cells) by fitting into or lining the N36 cavity. In a computer processor 
having a digital processor, a method of designing or identifying a drug or molecule 
5 which inhibits (totally or partially) the interaction of N36 and C34 or fits into or lines a 
cavity on N36, comprises the steps of: (a) providing a library of molecules, compounds 
or drugs whose crystal structures, coordinates, chemical configurations or structures are 
known; (b) providing a crystal structure of a target molecule, which is the a-helical 
domain of the gp41 component of HIV-1 envelope glycoprotein which represents the 

10 core of fusion-active gp41 (referred to for convenience as the N36/C34 complex or 
N36/C34); and (c) comparing coordinates, crystal structure components, chemical 
configurations or structures of members of the library of molecules with those of the 
target molecule, such as by using a processor routine executed by the digital processor 
to search the library to find a molecule or a molecule component which fits into or lines 

15 the cavity on N36, the processor routine providing design or identification of a member 
or members of the library which fit into or line the cavity on N36 or a member or 
members which comprise a component moiety or component moieties which fit into or 
line the cavity on N36. For example, this method can be carried out by comparing the 
members of the library with the crystal structure of gp41 N36/C34 presented herein 

20 using computer programs known to those of skill in the art (e.g., Dock, Kuntz, LD. et 
aL, Science, 257:1078-1082 (1992); Kuntz, LD. et aL,J. Mol Biol, 161:269 (1982); 
Meng, E.C., et al. 9 J. Comp.Chem., 75/505-524 (1992) or CAVEAT). 

In the method, the library of molecules to be searched in (a) can be any library, 
such as a database (i.e., online, offline, internal, external) which comprises crystal 

25 structures, coordinates, chemical configurations or structures of molecules, compounds 
or drugs (referred to collectively as to be assessed or screened for their ability inhibit 
N36/C34 interaction candidate N36 ligands). For example, databases for drug design, 
such as the Cambridge Structural Database (CSD), which includes about 100,000 
molecules whose crystal structures have been determined or the Fine Chemical Director 

30 (FCD) distributed by Molecular Design Limited (San Leandro, CA) can be used. [CSD: 
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Allen, F.H., et aL, Acta Crystallogr. Section B, 55:2331 (1979)] In addition, a library, 
such as a database, biased to include an increased number of members which comprise 
indole rings, hydrophobic moieties and/or negatively-charged molecules can be used. 
Coordinates of the molecules in the library can be compared in the method to 
5 coordinates of the cavity on N36 or to coordinates of C36 and its components which fit 
into or line an N36 cavity or pocket. The cavity on N36 is described in detail herein, as 
are key components of C34 which are accommodated by cavities on the N-peptide. 
Upon finding a match to coordinates of at least one molecule in the library, at least one 
member is, thus, determined or identified as an N36 ligand (at least one member is 

10 determined to be a member which will inhibit N36/C34 interaction). 

Additional steps in the searching process can include combining certain library 
members or components of library members to form collective coordinates or molecules 
which combine features or coordinates of two or more library members; comparing the 
resulting collective coordinates or molecules with the crystal structure of the target 

1 5 molecule and identifying those which will interact with an N36 cavity (or cavities). 
Upon identification of an existing drug or design of a novel molecule as 
described herein, its ability to line or fit into a cavity on N36 or block N36/C34 
interaction can be assessed using known methods, such as by expressing N36 and C34 
in an appropriate host cell (e.g., a bacterial cell containing and expressing DNA 

20 encoding N36 and C34), combining the expressed products with the drug to be assessed 
and determining whether it interferes with the interaction of N36 and C34, lines a cavity 
on N36 and C34. Drugs which are found to do so can be assessed in additional assays, 
both in vitro and in vivo (e.g., an appropriate animal model challenged by HIV 
infection). Once a drug has been identified or designed, it may be desirable to refine or 

25 reconfigure it in such a manner that a drug which binds better (e.g., with greater 
specificity and/or affinity) is produced. In this case, the processor routine further 
determines the quality of matches and calculates a goodness of fit, making it possible to 
do so. 

A drug or molecule which binds or fits into a cavity or pocket on the surface of 
30 N36, can be used alone or in combination with other drugs (as part of a drug cocktail) to 
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prevent or reduce HIV infection of humans. A drug designed or formed by a method 
described herein is also the subject of this invention. 

Also the subject of this invention is a method of treating an individual infected 
with HIV or at risk of being infected with HIV, in order to reduce the extent of infection 
5 or to prevent infection. In the method, a drug which fits into, lines or binds a cavity or 
cavities on N36 is administered to the individual, alone or in combination with other 
drugs. 

A further subject of this invention is an immunogen based on a molecule with 
coordinates as described herein which is used to produce antibodies that bind the N36 

10 cavity or pocket and, thus, prevent N36/C34 interaction and inhibit gp41 activity. For 
example, the N-peptide core can be used, in known methods, to produce polyclonal or 
monoclonal antibodies, which can be administered to an individual. Alternatively, an 
individual (e.g., a human infected with HIV or at risk or being infected) can be 
immunized with the N-peptide core. The individual will, as a result, produce antibodies 

15 which will bind the N36 pocket or cavity and prevent or reduce gp41 activity. Thus, 
this invention also relates to a vaccine to reduce or prevent gp41 function (and, as a 
result, HIV infection). 

As described above, Applicants have provided the identity of amino acid 
residues which form the cavity into which amino acid residues of the gp41 C-peptides 

20 fit. Thus, they have defined target amino acid residues which can be mutated or 
modified, individually or jointly, to further assess the structural basis for interaction 
between the two peptides, identify amino acid residues essential for the two to fit 
together and design or identify molecules or compounds which inhibit/prevent the two 
helices from fitting together and, thus, inhibit or prevent gp41 membrane - fusion 

25 activity. 

Numerous studies have led to the proposal that there are native (nonfusogenic) 
and fusion-active (fiisogenic) states of viral membrane fusion proteins. Extensive 
conformational changes in the HIV envelope complex are thought to be involved in the 
transition from the native to the fusogenic state. Binding of CD4 to gpl20 exposes the 
30 V3 loop of gpl20, which likely interacts with the co-receptors. Choe, H. et al., Cell 
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55:1 135-1 148 (1996); Trkola, A. et al 9 Nature 554:184-187 (1996); Wu, L. et al 9 
Nature 354:179-183 (1996). For some laboratory-adapted isolates of HIV-1, the 
conformational changes in gpl20 upon CD4 binding are sufficient to cause gpl20 to 
physically dissociate or "shed" from the viral surface, leaving the membrane-anchored 
5 gp41 subunit behind. Hart, T. K. et al 9 Proc. Natl. Acad. ScL, USA 55:2189-2193 

(1991); Moore, J. P. et aL, Science 250:1 139-1 142 (1990). Primary isolates of the virus 
generally do not shed gpl20 readily in the presence of CD4 alone, although CD4 
binding still induces conformational changes in gpl20. (Sattentau, Q. J. et al 9 Phil. 
Trans. Royal Soc. B 342:59-66 (1993); Sattentau, Q. J. et aL, J. Virol 67:7383-7393 

10 (1993); Sullivan, N. et aL, J. Virol. 69:4413-4422 (1995), Stamatatos, L. et aL, J. Virol. 
69:6191-6198 (1995)). 

CD4 binding also induces conformational changes in gp41, as inferred from 
changes in antibody binding and sensitivity to limited proteolysis (Sattentau, Q. J. et aL 9 
Phil. Trans. Royal Soc. B 342:59-66 (1993); Sattentau, Q. J. et aL, J. Virol. 

15 67:7383-7393 (1993)). Moreover, addition of low levels of soluble CD4 enhances the 
infectivity of some viral isolates, suggesting that the gpl20/gp41 conformational 
changes induced by CD4 play a role in membrane fusion (Allan, J. S. et al. 9 Science 
247:1084-1088 (1990); Sullivan, N. etal 9 J. Virol. 69:4413-4422 (1995)). These 
conformational changes are thought to expose the hydrophobic, glycine-rich 

20 fusion-peptide region of gp41 that is essential for membrane- fusion activity. 

To obtain a detailed structure for gp41, a protein-dissection approach, in which 
key substructures of a protein are identified and studied was applied. See, for example, 
Oas, T. G. et al 9 Nature 556:42-48 (1988). Limited proteolysis of a fragment 
corresponding to the ectodomain of gp41 generated a stable, soluble complex composed 

25 of two peptide fragments denoted N5 1 and C43 (Figure 1) that are derived from the N- 
and C-terminal regions of the ectodomain, respectively (Lu, M. et al. 9 Nature Struct 
Biol. 2:1075-1082 (1995)). In gp41, the region following the fusion peptide has a high 
a -helical propensity and a 4-3 heptad repeat of hydrophobic residues, a sequence feature 
characteristic of coiled coils. Chambers, P. et aL, J. Gen. Virol 77:3075-3080(1990); 

30 Delwart, E. L. et aL, AIDS Res. Hum. Retroviruses 6:703-706 (1990); Gallaher, W. R. et 
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al, AIDS Res. Hum. Retroviruses 5:431-440 (1989). The N51 peptide corresponds to 
the 4-3 hydrophobic repeat region adjacent to the fusion peptide, while the C43 peptide 
is derived from the region prior to the transmembrane segment (Figure 1). 

Interestingly, isolated peptides that overlap, or are derived from, the N51 and 
5 C43 regions of gp41 can have potent anti-viral activity (Wild, C. T. et al, Proc. Natl. 
Acad. Sci., USA 59:10537-10541 (1992); Wild, C. T. et al, Proc. Natl. Acad. ScL, USA 
91:9110-911 A (1994); Jiang, S. et al, Nature 365:1 13 (1993)). Peptides from the 
C-terminal region of the ectodomain have the highest activity. Consistent with these 
studies, both N51 and C43 are capable of inhibiting HIV envelope-mediated cell fusion; 

10 the C43 peptide exhibits 10-fold greater activity than N5 1 (Lu, M. et al, Nature Struct. 
Biol 2:1075-1082 (1995)). The inhibitory activity of the C43 peptide, however, is 
markedly reduced when stoichiometric amounts of N51 are present, suggesting that the 
C43 peptide inhibits membrane fusion in a dominant-negative manner, by associating 
with an N51 region within intact gp41 (Lu, M. et al, Nature Struct. Biol. 2:1075-1082 

15 (1995)). Thus, in addition to providing insights into the mechanism of membrane 

fusion, determining the structural basis for interaction between the N51 and C43 regions 
will assist anti-viral drug-development efforts. 

Biophysical studies showed that the N51 and C43 peptides associate to form a 
highly thermostable, helical, trimeric complex of heterodimers, in which the N51 and 

20 C43 helices are oriented in an antiparallel manner. Lu, M. et al, Nature Struct. Biol. 
2:1075-1082 (1995). Analogous experiments with the gp41 ectodomain from simian 
immunodeficiency virus (SIV) gave almost identical results, indicating that the gp41 
core identified in these protein-dissection studies is conserved among lentiviruses. 
Blacklow, S. C. et al, Biochemistry 34:14955-14962 (1995). On the basis of these 

25 results and other considerations, we proposed that the gp41 core consists of an interior 
coiled-coil trimer formed by the N5 1 region, against which three C43 helices pack. Lu, 
M. et al, Nature Struct. Biol 2:1075-1082 (1995); Blacklow, S. C. et al, Biochemistry 
34:14955-14962 (1995). 

The thermal denaturation of the N51/C43 complexes from HIV-1 or SIV gp41 is 

30 irreversible, probably as a result of aggregation of the unfolded peptides at high 
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temperature. Lu, M. et al, Nature Struct. Biol 2:1075-1082 (1995); Blacklow, S. C. et 
al, Biochemistry 34:14955-14962 (1995). With a view towards crystallographic 
studies, further protein dissection experiments were used to define a smaller subdomain 
with more favorable thermodynamic properties. These studies led to the identification 
5 of the peptides N36 and C34 (Figure 1). Like the longer peptides, N36 and C34 form a 
stable, trimeric complex of heterodimers with 100% a-helix content . Unlike the larger 
complex, however, the N36/C34 complex has a reversible thermal unfolding transition. 
Presented herein is the crystal structure of the N36/C34 complex solved to 2.0 A 
resolution, as well as a discussion of the implications of this structure for HIV viral 

10 membrane fusion and its inhibition. 

The work described herein provides good evidence that the structure of gp41 
obtained is found in the fusion-active state of HIV envelope. That this is the core of 
gp41 in the fusogenic state is supported by several considerations. 

First, the N36/C34 complex folds in the absence of gpl20. The fusogenic state 

15 of gp41 is expected to be stable in the absence of gpl20, since dissociation of gpl20 
from the envelope glycoprotein is thought to accompany the conversion from a native to 
a fusogenic state. Cohen, J., Science 274:502 (1996); Wilkinson, D., Current Biology 
6: 105 1-1053, (1996). Similarly, the conversion of influenza HA2 to the fusogenic state 
is accompanied by loss of most of its contacts with HA1. Proteolysis of the low-pH 

20 converted form of HA prior to crystallization removes most of the receptor-binding 

HA1 subunit. Bullough, P. A. et al, Nature 377:37-43 (1994). Moreover, the structural 
features of the fusogenic state are preserved in fragments of HA2 that fold cooperatively 
in the complete absence of the HA1 subunit. Carr, C. M. et al 9 Cell 73:823-832 (1993); 
Chen, J. et al, Proc. Natl Acad. Set, USA 92: 12205-1 2209 (1995). 

25 Second, the isolated gp41 core is exceedingly stable to thermal denaturation. 

The N51/C43 complex has an apparent melting temperature of approximately 90°C. 
Lu, M. et ai, Nature Struct. Biol 2:1075-1082 (1995). In contrast, the native state of 
the HIV envelope glycoprotein is not particularly stable, as evidenced by the ease with 
which gpl20 is shed in preparations of virus particles. Helseth, E. et al,J. Virol 
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65:21 19-2123 (1991); Kalyanaraman, V. S. et al, AIDS Res. Hum. Retroviruses 6, 
371-380(1990).. 

Third, mutations in gp41 that abolish infectivity and membrane fusion often map 
to residues that are expected to stabilize the gp41 core structure determined here. 
5 Numerous studies show that mutations in the 4-3 hydrophobic repeat region abolish 
membrane fusion, although these mutants tend to have additional defects. Dubay, J. W. 
et al, J. Virol. 66:4748-4756 (1992); Chen, S. S., J. Virol. 65:2002-2010 (1994); Chen, 
S. S. et al. 9 J. Virol. 67, 3615-3619 (1993); Wild, C. et al, Proc. Natl Acad. Sci., USA 
P7:12676-12680(1994);Poumbourios,P.,y. Virol 77:2041-2049(1997). The 

10 Leu-568-Ala, Trp-571-Arg, and Asn-656-Leu mutations are particularly noteworthy 
because cells expressing mutant envelope glycoproteins with one of these point 
mutations are completely defective in membrane fusion, as judged by an inability to 
form syncytia with CD4-positive human lymphocyte lines, even though the mutant 
proteins exhibit substantial cell-surface expression, CD4 binding, gpl20/gp41 

15 association, gpl60 precursor processing, and soluble CD4-induced shedding. Cao, J. et 
al, J. Virol 67:2747-2755 (1993). Leu-568 and Trp-571 are N36 residues that line the 
right wall of the cavity. Asn-656 is in an a position of the C34 peptide and packs 
against the central N36 coiled-coil trimer. The locations of these key mutations suggest 
that interactions between the N36 and C34 helices are critical for membrane fusion. 

20 Fourth, that the N36/C34 structure corresponds to the core of the fusogenic state 

of gp41 is consistent with a large body of data on the inhibition of HIV- 1 infection and 
syncytia formation by derivatives of the peptides that make up this core. This issue is 
discussed in more detail below. Finally, the structural similarity of the N36/C34 
complex to the low-pH induced conformation of influenza HA2 (Bullough, P. A. et al, 

25 Nature 371:37-43 (1994)) and to the structure of Mo-MLV TM (Fass, D. et al, Nature 
Struct. Biol 5:465-469 (1996)), each of which has been proposed to represent 
fusion-active conformations, supports the idea that N36/C34 is the core of the fusogenic 
conformation of gp41 . For all three structures, the hydrophobic fusion peptide would be 
immediately amino terminal to a central, three-stranded coiled coil. In influenza HA2 

30 and HIV-1 gp41, the central three-stranded coiled coils are each stabilized by three 
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helices that pack obliquely against the coiled-coil trimer in an antiparallel manner. In 
the TM subunit of Mo-MLV, these obliquely packed helices are replaced by a short 
helix and an extended region that serve a similar structural role. 

Work described herein also relates to inhibitors of HIV- 1 infection and targets 
5 for developing new peptidomimetic or small-molecule inhibitors of HIV infection. 
Synthetic peptides containing approximately 40 residues from gp41 that overlap, or 
include all of, the residues in N36 or C34 can be effective inhibitors, at micromolar to 
nanomolar concentrations, of HIV infection and syncytia formation. Lu, M. et al. 9 
Nature Struct Biol. 2:1075-1082 (1995); Jiang, S. et ai 9 Nature 365:113 (1993); Wild, 

10 C. T. et al, Proa Natl. Acad. ScL, USA 59:10537-10541 (1992); Wild, C. T. et al, 
Proc. Natl. Acad. Scl USA, 9 7:9770-977 r 4 (1994). Assessment previously of the 
inhibitory properties of the N51 and C43 peptides implied that these peptides work in a 
dominant negative manner (Herskowitz, L, Nature 329:219-222 (1987)) by binding to 
viral gp41 (Lu, M. et al, Nature Struct. Biol. 2:1075-1082 (1995)), a conclusion that 

15 was also reached through studies of a gp41 ectodomain chimeric protein (Chen, C. H. et 
al, J. Virol. 69:311 1-3777 (1995)). Further evidence in support of a dominant-negative 
mechanism is provided by the finding that mutations in C-peptide derivatives that 
disrupt their interactions with N-peptide correlate with decreased potency as inhibitors. 
Wild, C. et al, AIDS Res. Hum. Retroviruses 77:323-325 (1995). 

20 The gp41 core crystal structure is fully consistent with this dominant-negative 

mechanism of inhibition (Figure 3). The C-peptide derivatives could act as 
dominant-negative inhibitors by binding to the endogenous N-peptide coiled-coil trimer 
within viral gp41 . The N-peptides might inhibit fusion by interfering with formation of 
the central, coiled-coil trimer within viral gp41, and/or by binding to endogenous viral 

25 C-peptide regions. 

Both the N- and C-peptide classes of inhibitors are effective against a wide 
range of HIV strains, including laboratory-adapted strains and primary isolates. Wild, 
C. T. et al, Proc. Natl. Acad. ScL, USA 59:10537-10541 (1992); Jiang, S. et al, Nature 
365:113 (1993); Wild, C. T. et al, Proc. Natl Acad. Set, USA 91:9110-911 A (1994). In 

30 contrast, soluble CD4 and many neutralizing antibodies are typically effective only on a 
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limited subset of HIV strains (e.g., Daar, E. S. et al, Proc. Natl. Acad. Sci., USA 
57:6574-6578 (1990); Palker, T. J. et al 9 Proc. Natl. Acad. Sci., USA 55:1932-1936 
(1988); Nara, P. L. et al, J. Virol 62:2622-2628 (1988); Moore, J. P. et al 9 J. Virology 
69: 101-109 (1995). There is a striking conservation of residues involved in interactions 
5 between the N-peptide and C-peptide, comparing gp41 from HIV-1 and SIV. The broad 
neutralizing effects of the N- and C-peptides derive from the strong sequence 
conservation of these residues. 

The highly conserved, deep cavities on the N-peptide coiled-coil trimer that 
accommodate conserved C-peptide residues are useful targets for the development of 

10 new peptidomimetic or small-molecule inhibitors of HIV infection. The two indole 
rings and neighboring sidechains that occupy the prominent cavity in N36 are a 
particularly attractive target for the design and/or development of new drugs or 
identification of existing drugs which inhibit HIV infection. Not only is this cavity deep 
and highly conserved, but two of the three key mutations that disrupt membrane fusion, 

15 discussed above, map to one wall of this cavity. Because some of the known potent 
peptide inhibitors (Wild, C. T. et aL 9 Proc. Natl. Acad. Sci. f USA 91:9110-911 A (1994)) 
extend beyond N36 and C34 and do not involve this cavity region, it is likely that other 
distinctive surface features exist in the interface between the N- and C- helices of 
longer peptides such as N51 and C43. Lu, M. et al, Nature Struct. Biol 2:1075-1082 

20 (1995). The importance of identifying drugs that target the HIV membrane-fusion 

machinery is emphasized by the success of combination drug regimens for the treatment 
of AIDS. As yet, these combination therapies do not target the HIV envelope. A 
method of identifying a drug which is an inhibitor of N36/C34 peptide interaction (and, 
thus, is an inhibitor of the HIV membrane fusion machinery and, as a result, reduces or 

25 prevents HIV entry into (infection of) cells is the subject of this invention. In the 
method, N36 and C34 are combined with a drug to be assessed, under conditions 
suitable for N36 and C34 to interact (suitable for cavities on the N-peptide coiled-coil 
trimer to accommodate C-peptide amino acid residues). The resulting combination is 
maintained under these conditions for sufficient time to permit N36 and C34 to interact 

30 (e.g., for sufficient time for N36 and C34 to interact in the absence of the drug being 
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assessed). Whether interaction occurs and/or the extent to which N36 and , C34 interact 
is assessed, using known methods. If N36 and C34 do not interact or interact to a lesser 
extent in the presence of the drug being assessed than in the absence of the drug, the 
drug to be assessed is an inhibitor of N3 6/34 interaction. Such a drug is an inhibitor of 
5 the HIV membrane fusion machinery. Such an inhibitor can be further assessed, using 
in vitro or in vivo methods, for its ability to reduce or prevent HIV entry into cells. 

Results of the work described have implications for gp41 function and viral 
membrane fusion. The structures of the cores of the membrane-fusion subunits from 
HIV, Mo-MLV and influenza virus are remarkably similar. It appears that these diverse 

10 viruses present fusion peptides to target cells via a common scaffold, in which the 
fusion peptides are atop a central, three-stranded coiled coil that is supported by 
additional, carboxy-terminal structures. This scaffold is likely to be a common feature 
of viral membrane-fusion proteins since many of these proteins contain coiled-coil 
signature sequences, with 4-3 heptad repeats of hydrophobic amino acids, adjacent to an 

15 amino-terminal fusion-peptide region. Delwart, E. L. et aL, AIDS Res. Hum. 
Retroviruses 6:703-706 (1990); Chambers, P. et aL, J. Gen. Virol 77:3075-3080 
(1990); Gallaher, W. R. et aL, AIDS Res. Hum. Retroviruses 5:431-440 (1989). 
Moreover, studies of the fusion proteins of several paramyoviruses have identified 
regions with similarity to the N- and C- peptide regions of HIV and SIV gp41 (Lambert, 

20 D. M. et aL, Proc. Natl. Acad. ScL, USA 93:2186-2191 (1996)). These common 

structural features suggest that the rich body of work investigating the mechanism of 
membrane fusion for many other viruses, including influenza, is relevant for 
understanding the mechanism of HIV-mediated membrane fusion. 

Given the similarity in structure between the HIV gp41 core and the low-pH 

25 converted conformation of HA2, it is worth considering whether the structural 
rearrangements that occur during the transition of HA2 to the fuso genie state are 
analogous to those in gp41 . In the native, non-fliso genie conformation of influenza HA, 
part of the N-terminal coiled-coil trimer seen in the fusogenic state (Bullough, P. A. et 
aL, Nature 371:37-43 (1994)) is held in a non-helical, hairpin structure, as a result of 

30 extensive interactions with the receptor-binding HA 1 subunit (Wilson, I. A. et aL, 
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Nature 289:366-373 (1981)). Thus, the receptor-binding HA1 subunit acts as a "clamp" 
that binds this N-terminal region of HA2, holding it in the non-coiled coil conformation. 
The receptor-binding domains dissociate in the fusogenic conformation of HA, as in 
HIV, although in the case of influenza, the HA1 subunits are still tethered via a disulfide 
5 bond to HA2. Upon release of the HA1 clamp, a dramatic conformational change in 
HA2 occurs, including coiled-coil formation by this N-terminal region (Bullough, P. A. 
et al., Nature 577:37-43 (1994); Carr, C. M. et al. 9 Cell 73:823-832 (1993)). 

A substantial conformational change in the envelope glycoprotein complex also 
appears to be critical during HIV infection, although few details are understood. It 

10 remains to be determined whether the HIV envelope complex also utilizes coiled-coil 
formation as part of a spring-loaded mechanism, or if the gp41 core structure 
determined here is present in the native as well as the fusogenic state. It is possible that 
the N36/C34 structure is the core structure of gp41 even when it is bound to gpl20, and 
that release of gpl20 simply exposes the fusion-peptide region of gp41. Alternatively, 

15 HIV gpl20, like influenza HA1, may serve as a clamp that represses formation of the 
N36/C34 structure presented here, with gpl20 shedding allowing its formation. This 
gp41 core structure serves as the starting point for addressing this and other essential 
structural questions about the mechanism of HIV entry into cells. 

The present invention is illustrated by the following examples, which are not 

20 intended to be limiting in any way. 

EXAMPLES 

The materials and methods described below were used in the examples which 

follow. 

MATERIALS AND METHODS 
25 PEPTIDE PURIFICATION AND CRYSTALLIZATION 

Peptides N36 and C34 were synthesized by standard FMOC peptide chemistry 
and have an acetylated N-terminus and a C-terminal amide. N36 corresponds to 
residues 546 to 581 of gpl60, while C34 corresponds to residues 628 to 661. After 
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cleavage from the resin, the peptides were desalted on a Sephadex G-25 column 
(Pharmacia) and lyophilized. Peptides were then purified by reverse-phase high 
performance liquid chromatography (Waters, Inc.) on a Vydac CI 8 preparative column. 
The identity of the peptides was verified by mass spectrometry. Peptide concentration 
5 was determined by tyrosine and tryptophan absorbance in 6 M GuHCl. Edelhoch, H., 
Biochemistry 5:1948-1954 (1967). 

To grow crystals, a 10 mg/ml stock of the N36/C34 complex was diluted 1:1 in a 
sitting drop with 80 mM NH 4 C1, 20% PEG200, and 50% isopropanol and allowed to 
equilibrate against a reservoir of 80 mM NH 4 C1, 20% PEG200, and 30% isopropanol. 
10 Crystals grew as hexagonal prisms and belonged to the space group P321 (a=b=49.5 A, 
c=55.3 A). For native data sets and heavy atom screens, crystals were flash-frozen in a 
MSC cryogenic crystal cooler (X-stream), and data was collected on a Rigaku RU-200 
rotating-anode X-ray generator with an R-axis He detector. 

HEAVY ATOM SCREEN AND PHASE DETERMINATION 
15 Multiwavelength anomalous diffraction (MAD) data were collected at the 

Howard Hughes Medical Institute beamline X4A of the National Synchrotron Light 
Source at Brookhaven National Laboratory. Fluorescence spectra (1.1459 to 1.1354 A) 
were obtained from a single flash- frozen crystal soaked in 0.04% Os0 4 in harvest buffer 
(80 mM NH 4 C1, 20% PEG200, 30% isopropanol) for 4 hours. Based on the 
20 fluorescence profile, individual data sets were collected on Fuji imaging plates at four 
wavelengths (1,=1.1396 A, 1 2 =1.1398 A, 1 3 =1.1402 A, and 1 4 =1.1344 A). Reflections 
were integrated and scaled with DENZO and SCALEPACK. (Otwinowski, Z., 
Dares bury Study Weekend Proceedings, 1993.) 

Data merging, phase determination and map generation were all performed using 
25 the CCP4 suite of programs. CCP4, Acta Cryst. £50:760-763 (1994). Anomalous and 
dispersive difference Patterson maps from MAD data sets all showed a single clear peak 
corresponding to the osmium binding site. The position of the site was calculated from 
the single z=0 Harker section and from cross peaks found at z=0.28 and z=0.71. Phases 
generated with the program MLPHARE (Otwinowski, Z., Daresbury Study Weekend 
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Proceedings, 1991) gave an overall figure of merit of 0.89 (Table) and produced an 
interpretable electron density map with a clear solvent boundary. Density modification 
was subsequently performed using DM (Cowtan, K. D., Newsletter on Protein 
Crystallography 57:34-38 (1994)), resulting in maps of high quality in which electron 
5 density for the entire main chain and all side chains was evident. 

MODEL REFINEMENT 

The polypeptide chain was traced and the side chains readily positioned into a 
2.7 A density-modified map using the program O (Jones, T. A., and Kjeldgaard, M., O 
- The Manual, Uppsala, Sweden: http://kaktus.kemi.aau.dk, 1992). The initial model 

10 of N36/C34 was refined with the program XPLOR (Briinger, A. T., A system for X-ray 
crystallography and NMR. X-PLOR Version 3.1, Yale University Press, New Haven, 
Connecticut, 1992) against data to 2.0 A from a native crystal. An anisotropic B-factor 
was applied to the native structure factors using XPLOR, and a free R set (Briinger, A. 
T., Nature 355:472-475 (1992)) was taken from the data prior to refinement (Table). 

1 5 The model was refined by iterative cycles of grouped B-factor, positional, and 

individual B-factor refinement. As the refinement proceeded, 43 waters were added and 
a bulk solvent correction was applied. At no time during the refinement did the 
molecule differ enough from the original model so as to require manual rebuilding, 
though main chain and side chain geometries were optimized in O between cycles of 

20 refinement. The quality of the structure was verified by PROCHECK (Laskowski, R. 
A. et al., J. Appl Cryst. 25:283-291 (1993)), with all residues but one (Ile-580) 
occupying most-preferred regions of Ramachandran space. Ile-580 lies in the 
additionally allowed region of Ramachandran space and is the second residue from the 
C-terminus of the N36 peptide; inspection of the solvent-flattened MAD-phased maps 

25 confirmed its position. 



EXAMPLE 1 PRODUCTION OF CRYSTALS OF N36/C34 

Crystals of N36/C34 were grown by sitting-drop vapor diffusion (see Methods). 
An initial model of the complex was built into an electron density map generated by 
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multi-wavelength anomalous dispersion (MAD) analysis (Hendrickson, W. A., Science 
254:51-5$ (1991)) of an osmium-derivatized crystal. Details of data collection and 
MAD phasing statistics are listed in the Table. A representative portion of the 
solvent- flattened electron density map used for building the initial model is shown in 
5 Figure 2. The structure was refined against data to 2.0 A from a native crystal to yield 
an R free of 0.266 and an R cryst of 0.238 (Table). 
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EXAMPLE 2 ASSESSMENT OF THE STRUCTURE OF THE N36/C34 COMPLEX 
The N36/C34 complex is a six-stranded helical bundle (Figure 3). The center of 
this bundle consists of a parallel, trimeric coiled coil of three N36 helices wrapped in a 
gradual left-handed superhelix. Three C34 helices wrap antiparallel to the N36 helices 
5 in a left-handed direction around the outside of the central coiled-coil trimer. The 
complex is a cylinder measuring -35 A in diameter and -55 A in height. 

As in other naturally-occurring coiled coils (Cohen, C. et aL, Proteins 7:1-15 
(1990)), the interior residues at the a and d positions of the N36 heptad repeat are 
predominantly hydrophobic, although occasional buried polar interactions are also 

10 present in the central three-stranded coiled coil (Figure 4). A sequence comparison of 
HIV-1 (HXB2 strain) and SIV (Mac239 strain) gp41 shows that the residues at these 
two heptad-repeat positions are highly conserved (Figure 4). The characteristic 
"knobs-into-holes" packing of coiled coils is utilized, whereby the residues (knobs) at 
the a and d layers pack into cavities (holes) between four residues of an adjacent helix 

15 (Crick, F. H. C, Acta Cryst. 6:689-697 (1953); O'Shea, E. K. et aL, Science 
254:539-544 (1991)). Of the three types of knobs-into-holes packing geometry 
observed in coiled-coil structures (Harbury, P. B. et aL, Science 262:1401-1407 (1993); 
Harbury, P. et aL, Nature 377:80-83 (1994)), the N36 trimer demonstrates exclusively 
"acute" packing geometry, similar to that found in the crystal structure of an 

20 isoleucine-zipper trimer (Harbury, P. et aL, Nature 577:80-83 (1994)). This type of 
packing arrangement in the interior of the coiled coil is characteristic of trimers because 
it allows P branched residues (e.g., isoleucine) to pack favorably at both the a and d 
positions (Harbury, P. et aL, Nature 377:80-83 (1994)). Trimeric coiled coils, like the 
N36 trimer (Figure 4), tend to have p branched residues at both the a and d positions. 

25 Although complexes of the N- and C- peptides are clearly trimeric (Lu, M. et 

aL, Nature Struct Biol 2:1075-1082 (1995); Blacklow, S. C. et aL, Biochemistry 
34:14955-14962 (1995)), isolated N-peptides corresponding to the 4-3 hydrophobic 
repeat from gp41 have been reported to form tetramers, leading to conflicting 
conclusions regarding the oligomeric state of gp41 (Lawless, M. et aL, Biochemistry 

30 35: 13697-13708 (1996); Rabenstein, M. et aL, Biochemistry 34:13390-13397 (1995); 
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Rabenstein, M. D. et aL, Biochemistry 55:13922-13928 (1996); Shugars, D. C. et al.,J. 
Virol. 70:2982-2991 (1996)). An electrostatic potential map of the N3 6 coiled-coil 
trimer shows that its surface is largely uncharged. The grooves that are the sites for C34 
interaction are lined with predominantly hydrophobic residues (see below) that would 
5 be expected to lead to aggregation upon exposure to solvent. Indeed, previous studies 
have shown that the isolated N-peptides tend to aggregate (Blacklow, S. C. et aL, 
Biochemistry 14955-14962 (1995); Lu, M. et aL, Nature Struct. Biol. 2:1075-1082 
(1995)). Thus, conclusions regarding the oligomerization state of gp41 based on studies 
of isolated N-peptides are probably misleading. The N36/C34 complex shows a much 
10 more highly charged surface due to acidic residues on the outside of the C34 helices, 
explaining the greater solubility of the heterodimeric complex. 

EXAMPLE 3 DETERMINATION OF INTERACTIONS BETWEEN THE - AND C- 
PEPTIDE HELICES 
Three C34 helices pack obliquely against the outside of the N36 coiled-coil 

15 trimer in an antiparallel orientation. These C34 helices interact with N36 mainly 

through hydrophobic residues in three grooves on the surface of the central coiled-coil 
trimer. Sequence comparisons between HIV and SIV gp41 shows that the residues 
lining these grooves are highly conserved. In contrast, the N36 residues flanking the 
C34 helices are divergent between HIV and SIV. 

20 This pattern of sequence conservation is also apparent on a helical wheel 

representation of three N36 helices and one C34 helix (Figure 4). In this diagram, the 
residue positions in C34 are depicted as ellipses to indicate the oblique tilt of the C34 
helix relative to the N36 superhelix and to emphasize that C34 is not part of a coiled 
coil. Residues at the e and g positions of the N36 helices lie on the outside of the 

25 central coiled coil and point into the triangular interhelical space between two N36 
helices and a buttressing C34 helix. In general, residues at positions a and d of C34 
pack against residues at the e and g positions of the N36 helices (Figure 4), although 
contacts at other positions are often observed. Comparing HIV and SIV gp41, no 
nonconservative changes exist at the e and g positions of the N36 helix, and only two 



0399.1167-007 

-27- 

such changes occur at the a and d positions of C34. In contrast, 8 of the 9 
nonconservative changes in the N36 helix occur at the outside f, b, and c positions, 
while 13 of the 15 nonconservative changes in the C34 helix occur at positions other 
than a and d. The sequence of the N-peptide region of gp41 is among the most highly 
5 conserved within the HIV envelope glycoprotein. Our results show that the high 
sequence conservation in this region results from selective pressure on the e and g 
positions to retain C34 peptide interactions, as well as pressure on the a and d positions 
to maintain trimeric coiled-coil interactions. 

Each of the grooves on the surface of the N36 trimer has a particularly deep 

10 cavity. This cavity is large (~ 16 A long, ~7 A wide, and 5-6 A deep) and 

accommodates three hydrophobic residues from the abutting C34 helix: Ile-635, 
Trp-631 and Trp-628. The top of the cavity is lined by Leu-566 of the left N36 helix 
and Leu-565 of the right N36 helix. Side chains from the left N36 helix form the left 
side of the cavity, including residues (top to bottom) Val-570, Lys-574 (aliphatic 

15 portion), and Gln-577. The right wall is formed by residues Leu-568, Trp-571, and 
Gly-572 of the right N36 helix. The floor of the cavity is composed of Thr-569 and 
Leu-576 of the right N36 helix, and also Ile-573 of both N36 helices. With the 
exception of Ile-573 (which is replaced by Thr), all the residues forming the cavity are 
identical between HIV-1 and SIV. In addition to these predominately hydrophobic 

20 interactions within the cavity, Asp-632 of C34 forms a conserved salt bridge with 
Lys-574 of N36 immediately to the left of the cavity. 

EXAMPLE 4 COMPARISON OF THE STRUCTURE OF THE N36/C34 COMPLEX 
WITH THE LOW-pH INDUCED CONFORMATION OF HA 
The N36/C34 complex shows striking structural similarity to the low-pH 
25 induced conformation of the influenza HA 2 subunit (TBHA 2 ) (Bullough, P. A. et al, 
Nature 377:37-43 (1994)) and to the TM subunit of Mo-MLV (Fass, D. et al, Nature 
Struct. Biol 3:465-469 (1996)), each of which has been proposed to be a fusogenic 
conformation. Remarkably, the core of each of the three structures contains a 
three-stranded coiled coil that would be adjacent to the amino-terminal fusion peptide. 



» 0399.1167-007 



-28- 



The trimeric coiled coil of gp41 is very similar to that of the Mo-MLV TM, both having 
a similar superhelical pitch (-175 A) and a regular 4-3 periodicity. In contrast, the 
TBHA 2 coiled coil is atypical because it contains two regions with skips in the 4-3 
periodicity, resulting in an underwound superhelix (pitch of 300-400 A). As in the gp41 
5 core structure, TBHA 2 contains three antiparallel helices that are packed, with a 
left-handed tilt, against the central trimeric coiled coil. 

EQUIVALENTS 

While this invention has been particularly shown and described with references 
to preferred embodiments thereof, it will be understood by those skilled in the art that 
10 various changes in form and details may be made therein without departing from the 
spirit and scope of the invention as defined by the appended claims. Those skilled in the 
art will recognize or be able to ascertain using no more than routine experimentation, 
many equivalents to the specific embodiments of the invention described specifically 
herein. Such equivalents are intended to be encompassed in the scope of the claims. 



